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THE PROBLEM OF MAYER WITH VARIABLE END POINTS* 



GILBERT AMES BLISS 

Let y denote a set of variables (yi, • • • , y n ) each of which is a function of 
x, and let y(xi), y{x 2 ) be symbols for the end values of these functions 
at Xi and x 2 . The latter will also be represented when convenient by the sets 
( 2/ii > • * • > ym ) , ( 2/12 , • • • , y n z ) > respectively. The problem to be considered 
is the determination of the properties of an arc 

(E) y = y(x) (a;i<a;<a5 2 ) 

which satisfies a system of differential equations 

(1) <P*{x,y,y') = ( M = l,2, ■■•,m <n) 
and minimizes the first of a set of functions 

f„(xi,y(xi),x 2 ,y(x 2 )) ( P = l,2, ••.,r<2» + 2) 

while the rest f 2 , • • • , f n are made to vanish. It will be shown that if 
denotes the function 

Q = Xi 0i + • • • + X m <j> m , 

then a necessary condition for a minimum is that there shall exist m functions 
Xi (ar) , • • • , X m (x) not all identically zero on Xi x 2 , satisfying the equationsf 

j 

(2) fl y4 -^0, s , = (*-l,2,...,n) 

and making all determinants of order r + 1 of the matrix 

Wt Mil 3 /p d h 



(3) 



d#i dyia dx% dy k2 

n n 

®{xi)-Yi,% k >(xi)y'ki &V(a;i) -^(X2)+Y l ^ yt >(x i )y' k2 -Q n ,(x 2 ) 
*=i *=i 

vanish. In the rows of the matrix, k = 1 , -• • , n, while in the columns 



* Presented to the Society December 22, 1916, with the title, A device applicable to the 
problems of Lagrange and Mayer in the calculus of variations. 
t The subscripts denote partial derivatives of £2. 
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p = 1 , • • • , r, and the arguments in the derivatives of the functions f p , and 
in Q and its derivatives, are in every case those belonging to E . 

The classical problem of Mayer in the calculus of variations is a special 
case of this when /i = yu and ft , • • - ,f r are the functions 

Xi — cai, Vi\ — ft-i, x 2 — a 2 , yia — fe 

(j = 1, •••,n; k = 2, •■■,n), 

the a's and /3's being constants. In a recent paper Bolza* has formulated a 
problem in which he admits so-called finite equations not involving derivatives 
y' in the system (1), and for which the expression to be minimized, in the 
non-parametric case, is a sum 

U= I f(x,y, y')dx + G(x u y(xi),x 2 , y(x 2 )) 

instead of the function /i . When 6 = this is the Lagrange form of problem, 
and when / = it has the form stated in the first paragraph above. 

It will be shown in a later section that the problem of Mayer with variable 
end points, stated in the first paragraph above, and the problem of Bolza 
are equivalent in the sense that the Lagrange multiplier rule for each is 
deducible from that of the other. Each of them contains all of the classical 
problems of the calculus of variations not involving partial derivatives as 
special cases, with the exception of those of a type exemplified by an example 
proposed by Hadamard.f But the results for the problem which is the prin- 
cipal study of this paper are the most symmetrical. The proofs given below 
are new, an important feature being the use and symmetric treatment of the 
hypothesis that the matrix || dp^/dyi || (ju = 1 , • • • , m; k = 1 , • • • , n) 
is of rank to at every point of E, instead of the usual unsymmetric assumption 
that a particular one of the determinants of order to of the matrix is every- 
where different from zero on E . 

1. Preliminary notions and theorems 

In the following pages it will be understood that the various indices unless 
otherwise expressly stated have the ranges 

i, j,k = 1, ••• ,n; p,v = 1, ••• , m; p,<r = 1 , • • • , r; t = m + 1 , •■• ,n. 

An equation written with one of these indices is supposed to hold for every 
one of its values. 



* tJeber den " anormalen Fall " beim Lagrangeschen und Mayerschen Problem mit gemisch- 
ten Bedingungen und variabeln Endpunkten, Mathematische Annalen, vol. 74 
(1913), p. 403. 

t Lesons sur le calctd des variations, p. 176. 
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The arc E is supposed to have the following properties: 

1. It is of class C"* and such that the functions <p^ , f p are also of class C" 
near the values of their arguments on E . 

2. It satisfies the equations tp^ = fz = • • • = /« = . 

3. The matrix || d<p„./dy' h || has rank m at every point of E. 

4. The matrix 

%l Mil %l Mil 
dxi dy k i dx 2 dy k2 

with 2ra + 2 columns and r rows is of rank r at the values of the arguments of 
the functions / p on £. 

The last two hypotheses are to secure the independence of the differential 
equations and of the functions / p near E . 

It will be shown in Section 3, as a consequence of hypothesis 3 , that n — m 
functions <p T (x, y, y') can always be adjoined to the m functions ^ in such 
a way that the complete set <pi has the properties of Hypothesis 1 and a deter- 
minant | d<pi/dy' k | different from zero everywhere on E. If z represents 
always a set ( 21 , • • • , z„ ) of which the first m elements are identica'ly zero, 
then the arc E determines uniquely the n — m remaining functions in z so that 

<Pk(x,y, y') - z ft = 0. 

This is the extended system of differential equations formed from the system (1) . 
Consider a one-parameter family of arcs 

y*> = yk(x,u), z k = z k (x,u) [x l (u)<x^x 2 (u)] 

which contain E for u = and satisfy ft = • • • = f r = and the extended 
system of differential equations for every u . Its variations are the expressions 

(4) £i = zi„(0), ? 2 = a;2«(0) ) y k (x) = y ku {x, 0), f*(a;) = z ku (x, 0), 

where the subscript u denotes differentiation. It is understood that f as well 
as 2 always represents a set of elements ( fi , • • • , f„ ) in which the first m are 
zero. The variations satisfy the equations of variation 

**(ij) -r*-*i(€,i?) =Frii,-n) = 

where 



^-^ I d<pi d<fi /\ 



(5) 



^«^)-^fc + ?^(if«& + n«)+^6 



+ 5^(y«& + HM), 



* A function is of class C" if it has continuous first and second derivatives. 
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and the functions y, y' occurring explicitly and in the derivatives of <pt and 
/ p are those defining E . 

The existence theorems for linear differential equations justify the following 
lemma: 

Lemma 1. The equations 

(6) *<(,) -r».= o 

determine uniquely a set of solutions rn of class C" in a neighborhood of the 
values xi s= x Si x% , when functions f »• of class C in this neighborhood and the 
initial values of the rfs at a single point are given. 

Consider now a system H of r sets of variations £* , i-% , rf h , f I with f 's of 
class C , ri's of class C" , and satisfying the equations (6). Variations of this 
sort with fi, £ 2 arbitrary constants are called admissible variations. Then 
with the help of the existence theorems for implicit functions and differential 
equations, and denoting the set ( ei , • • • , e r ) by e , it is possible to prove 

Lemma 2. For every system H of variations the equations 

(7) <Pk(x, y,y') = z k (x) + ei{' k + ••■ + e r f£ 
determine a family of solutions Y k (x, e) with initial values 

(8) Y k (xi, e) = y k (xi) + enj*(iCi) + ••• + e r vf k {xi), 
and of class C in e and C" in x near the values 

X\ S=i x Si Xi , ei = • • • = e r = 

for which it contains the arc E . The ^-variations of the family with respect to 
the parameter e„ are r\ k , f k . 

For there is a neighborhood of the values ( x , y , e ) satisfying the conditions 

(x, y) on E, e x = e 2 = • • • = e r = 0, 

in which the equations (7) have a set - of solutions 

y' k = fa(x, y, e) 

of class C",* and if to these equations are adjoined those of the set de r /dx = 0, 
then the existence theorems for differential equations establish the existence of 
a set of functions Y k ( x , e ) with the properties described in the lemma.f Since 
the ^-variations of the family with respect to the parameter e„. satisfy the 
equation (6), since from (8) the initial values of the ij's are the values rf h (xi) , 
and finally since the corresponding f-variations are clearly the functions f £ , 

* Bolza, Vorlesungen Uber Variationsrechnung, p. 166, § 22). See also Bliss, Fundamental 
Existence Theorems, The Princeton Colloquium Lectures, p. 19. 
t Bolza, loc. cit., p. 168, §§ 23, 24. 
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it follows from Lemma 1 that the ijf-variations of the family with respect to 
e„ are the functions t]l , fj as stated. 

2. The Lagrange multiplier rule 

In order to prove finally the result stated in the introduction two further 
lemmas will be useful. 

Lemma 3. If a system H of variations exists such that the determinant 
I Fp ( %" > v" ) I is different from zero, then E can not make fa a minimum. 

For suppose such a system H of variations £f , £2 , rf k , $% and consider the 
functions Yu ( x , e) which they define as described in Lemma 2. If 

X\ = Xl + €i £ + • • • + € r £1, X 2 = Xi + «j £ 2 + • * * + €r &, 

/,(«i, ••■,*) =/,Ui, y(Zi,«),z„ y(x 2 ,e)) 

it is readily seen that for €1 = • • • = e r = the functional determinant 
I df p /de v I is identical with | F f ( i-" , 7? ff ) | and is different from zero. Hence 
the equations 

/1 - u = / 2 = • • • = / r = 

have an initial solution m = /1 ( , , • • • , ) , €1 = 62= • • • = e r = 0, at 
which their functional determinant with respect to ei , • • • , e r is different from 
zero. They have therefore solutions for ei , • • • , e r in terms of u , showing 
that there exist arcs in every neighborhood of E satisfying 

«V = fa = • • • = fr = 

and giving /1 values u both greater and less than the value /1 ( , • • • , ) 
which belongs to E . This proves the lemma. 

Lemma 4. If E makes fa a minimum there must be constants l„ not all zero 
such that the sum 

(9) hF 1 (Z,i,)+---+lrFr(Z,1 l ) 

vanishes for every set of admissible variations satisfying equations (6). 

By Lemma 3 the determinant | F p ( £ " , rf ) | must vanish for every system 
H . Let s be the maximum rank possible for the determinant and suppose H 
selected so that this rank is actually attained. If the constants l a are so 
chosen that the expression (9) vanishes for every variation in H then (9) must 
also vanish for every admissible variation whatsoever. Otherwise the maxi- 
mum rank of the determinant would be greater than s . 

To prove the result described in the introduction let 

f=hfl+ ••• +hfr, 
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(10) 



C^i * dyici 



0#2 * 02/42 



tfifcl = 



dm' 



qki = 






k2 



the end values of the variables in the expressions on the right being always 
those belonging to E . The values of p\ , p 2 , qja , <7*2 can not all be zero since 
in that ease the matrix in Hypothesis 4 of Section 1 would necessarily be of 
rank less than r . From equation (5), Lemma 4, and equation (6) it follows 
that the equations 



(ID 



(12) 



Pi Si +Z?M Vki - V% & - XflW Vki = 0, 

* k 

I Z>* ?* Ar = I Z (^* Vk + SI,,, Vk) dx 

vxi k */xi k 



= pZ(K~^K')dx + ZK' 



Vk 



hold for every set of admissible variations and every set of functions Xj ( x ) 
of class C on x\ x% . But the X's can be chosen satisfying the n differential 
equations with initial conditions 



d n 



(13) V yk - ^ IV, = , A,,, (x 2 )=q k2 . 
The sum of (11) and (12) then gives 

I XXfc ?* *» = Pi Si + Z) [?*1 - ®vA x l)] Vkl - 2>2 £2 
Jx\ k k 

as an equation which must hold for all admissible variations, and therefore 
by Lemma 1 for arbitrarily selected values Si , S2 , Vki , T* • It follows readily 
then with the help of the Fundamental Lemma of the calculus of variations* 
and the initial conditions (13) for the functions X& that 

(14) X T = pi = qn ~ Ay*' (xi) = - p 2 = qki - A^ (x 2 ) = 0. 

It is clear therefore that the last n — m functions X T vanish identically. But 
the first m do not, since the last equations would otherwise make all of the 
constants p\, p 2 , qui, 9*2 vanish. The last four of equations (14) however 
imply that all determinants of order r of the matrix 



(15) 



d*i r dy kl Vh dy kl 







fl tf4 '(«i) 







Ml. 

dy k 2 



1 Bolza, loc. cit., p. 25. 
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vanish since they are obtained by multiplying the columns of this matrix by 
h, • • • > lr, — 1 , adding, and equating the results to zero. The matrix (3) 
is obtained from the one just given by subtracting from the first and 
( n + 2 ) d columns linear combinations of the others, and by introducing in 
the last row the terms Q, ( x\ ) , £2 ( x 2 ) which are zero. This last step is entirely 
artificial and unnecessary, but it turns out that the matrix (3) gives more 
directly the customary results for many special cases. Since the matrix (3) 
is obtainable as just described from the last one, it follows that (3) also must 
have rank less than r + 1. 

The statement just proved that when E minimizes fi there must exist m functions 
Xi , • • • , X m of class C not all identically zero on Xi Xi , satisfying the equations 
(2), and making all determinants of order r + 1 of the matrix (3) vanish, is equiva- 
lent to the statement that there must exist m functions X of class C satisfying the 
equations (2), and r constants h, • • • , l r not all zero satisfying with them the 
equations 
(16) pi = q kl - ^(zi) = p 2 = q k2 - Q,»»(a;j) = 0, 

where the values of pi, qia, P2. qw are defined by the equations (10). 

For the equations (16) are the linear equations in l\, • • • , l r , — 1 whose 
coefficients are the columns of the matrix (15). It is further a consequence 
of the existence theorems for differential equations that the only solutions of 
the equations (2) which can vanish simultaneously are the solutions 

Xi = • • • = X m = . 

Hence a set of solutions not identically zero must have values not all zero at 
Xi and Xi , and consequently defines derivatives Sl n > ( Xi ) , fl yt / ( x% ) not all zero. 
From the fact that the matrix (15) is of rank less than r + 1 it follows readily 
therefore that multipliers h , • ■ • ,l r not all zero exist satisfying equations (16). 
Hence the first statement of the theorem implies the second. The proof that 
the second implies the first is immediate. 

It is important to note that a minimizing arc E is said to be normal for 
the problem under consideration when a system H of variations can be so 
selected that the matrix \\ F p (^" , v") \\ (p = 2, • • • , r : a = 1 , • • • , r ) has 
rank r — 1 ; otherwise it is abnormal. 

For a normal minimizing arc E the constant h is always different from zero, 
and the constants and functions h = 1 , 1% , • • • , l r , Xi , • • • , X m satisfying 
equations (2) and (16) are unique. In the abnormal case there always exists a 
set h, • • • , l r , Xi , • • • ,\ m with h = . 

It is clear that h can not be zero in the normal case since l\, • • • , l r make 
the expression (9) vanish for every set of admissible variations, and since the 
system H can be selected so that the matrix 
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\\F fi W,r,n\\(p = 2, ...,r:cr = l, • • • , r) 

has rank r — 1 . The constants h = 1, h, • • • , l r are for a similar reason 
seen to be unique. Furthermore two systems X, X satisfying equations (2) 
and (16) with these constants would satisfy 

SVOi) = flW(zi)- 

Hence the differences of the corresponding elements of X and X would vanish 
at xi and satisfy equations (2). They would therefore be identically zero. 

The statement of the theorem with regard to the abnormal case is an 
immediate consequence of the way in which the constants h , • • • , l r were 
originally determined. 

3. An auxiliaby theobem 

It was stated in Section 2 that it is always possible to adjoin to the functions 
<P». ( x > y > y' ) a set of functions <p T ( x , y , y') in such a way that the deter- 
minant | d<pi/dy' k | is different from zero along the arc E. The functions <p T 
can in fact be taken in the form 

<p T = a Tl (x)y[ + ••• +a Tn (x)y' n , 

where the functions a Tk are polynomials in xi and x<i. If the derivative 
dVn/dy'k formed for the arc E is denoted by a^ h {x) , this result is an immediate 
consequence of the following theorem : 

Auxiliaby theobem. If a matrix || a^W of continuous functions a llik (x) 
has rank m at every point of the interval xi x% , then n — m additional rows of 
polynomials a Tk (x) can always be selected so that the determinant [ ane \ is 
everywhere different from zero on xi Xi . 

The rank of the matrix 1 1 a^ h \ \ is unchanged if one row is multiplied by a 
continuous function of x and added to another, and a proof of the theorem for 
the matrix so altered implies its validity also for the original one. If the 
first row of || a^ || is multiplied by a suitable factor and added to the second, 
the latter can be made orthogonal to the first, and by repetitions of this 
process every row can be made orthogonal to every other. 

Consider then the linear equations in U\ , • • • , u n whose coefficients are 
the rows of an orthogonal matrix 1 1 a M4 1 1 . If n — m properly selected u's 
are arbitrarily chosen in a neighborhood of x\ as continuous functions of x 
not vanishing simultaneously, the m remaining u's will be uniquely determined 
by the equations, since one at least of the determinants of the matrix is dif- 
ferent from zero at Xi . Let $ then be the upper bound of the values x defining 
intervals Xi x on which the equations have continuous solutions not vanishing 
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simultaneously. There is a neighborhood of £ in which one of the determinants 
of the matrix remains different from zero, and x can be selected in this neigh- 
borhood between Xi and £ . The equations have solutions u as just described 
on the interval Xi x and it is possible to extend the definitions of n — m of 
them so that they are continuous from x\ to £ + h and not simultaneously 
vanishing to the right of x . If the n — m extended u's have been properly 
selected the remaining m will be uniquely determined from x to £ + S by 
the equations, showing that £ can not be the upper bound described unless it 
is at x 2 . Furthermore in the latter case the same argument shows that there 
exist n continuous solutions u not vanishing simultaneously on the interval 

Xi X 2 . 

If the row of functions u so determined is adjoined to the matrix || o Mi || 
the resulting matrix has rank r + 1 at every point of Xi x 2 . For at a point 
where this were not true there would exist m solutions v of the n equations 

Uk = vi an + • • • + v m a mk . 

The sum of these equations multiplied by the elements of the ,uth row of the 
matrix would show that v„. must vanish, since the /*th row is orthogonal to 
the w-row and all the others. The vanishing of all the v's would, however, 
contradict the fact that the u's do not vanish simultaneously. 

It is clear therefore that one row of continuous functions am+1, k can be 
adjoined to the matrix 1 1 a Mft 1 1 in such a way that the new matrix has rank 
m + 1 . Continuation of this process leads to a determinant \ a ik \ which is 
nowhere zero on x\ xi . Furthermore since every continuous function can be 
approximated by a polynomial with any desired degree of accuracy it follows 
readily that the elements of the adjoined n — m rows could be chosen as 
polynomials if desired. 

4. Relations between different types of problems 

For the problem of Bolza in non-parametric form the function to be 
minimized is 

U= I f(x,y,y')dx + G(xi,y(x 1 ) ) X2,y(x 2 )), 

while the equations of condition and end conditions are 

<Pa(x, y, y') =0 (« = 1, ••-,?), 

<Ae (*,y) =0 (/» - l, ---.a), 

Xy(*i> y(xi),x 2 ,y(x 2 )) =0 (7 = 1, •••, r). 
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He assumes that the determinant 



dyi 


dy m 


dyi 


He 
dy m 



(a = 1, •••,?) 

(0-1, •■■-9) 
(m=p-f ?) 



is different from zero and that the end conditions are " reduced and non- 
singular on E." The last means that the matrix of first derivatives 

with respect to their arguments has rank 2q + r at the values of the arguments 
on E. 

When / = and the equations ip f = are absent this is exactly the form 
treated in the present paper, and the results which Bolza states in his Satz II* 
are exactly those of the next to last theorem of Section 2 above. On the 
other hand the problem just given is precisely that of determining an arc 

yo(x),yi(x), ■■■ ,y n (x) 

which satisfies the differential equations 



y'o -/ 



<Pa 



dt 



ty? 



"B + Z^y^o 



dx ' V dy k 
and gives the first of the functions 

2/02 - «/oi + G(xi, y(xi), x 2 , y{x 2 )), ^ (si, y(xi)), 
X y (xi,y(xi), x 2 ,y(x 2 )) 



(Xi 


= £ 


= 


%), 


(« = 


= 1, 




,P) 


(/3 = 


= 1, 




,«) 



((8 = 1, ••-,?) 
(-y = 1, ■■■ ,y) 



a minimum value while making the rest vanish. It is readily seen that this 
has the form treated in the preceding sections, and by applying the results of 
Section 2 it turns out that the various constants and functions for the two 
problems correspond as in the following table : 

(A) h, Zi +3 — Xp+p (xi) , Xp+p (x 2 ) , h + p , h+q+y, X a (a:), — X p+|3 (x) 

(B) la, lp , lp , l e , l q +y , X tt ( X ) , flp ( x ) 

a = l,---,p; P = l,---,q; 7 = 1, •••,?•. 

The second row is the one containing the elements in Bolza's notation which 

appear in his paper. From this table and the next to last theorem of Section 

2 one deduces at once Bolza's Satz II. 

University of Chicago 
September, 1917 



•Mathematische Annalen, loe. cit., p. 438. 



